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1 

The preseat invention rekt&s to a method of screening a protem fer 
mvolvement in cancdr. 

Cancer represents the second highest cause of mortality, after heart disease, in 
most developed countries. Cuireut estimates suggest that one bx three Amaricans 
alive at present will suffer fiom some farm of cancer. 

Many different fomis of cancer exist, and it is helieved that fliere are many 
different causes of the disease. Amongst the known causes of cancer are DNA 
damage, for example as a result of exposm-e to carcinog^c chemicals or radiatioDg 
and the actions of transforming viruses. It is further zecognised that many cancers 
re^t from aberrant gene expression, for example as a result of abnormal levels of 
grae eTCptesdon or of expression of mutated or otherwise altered gene products. 

Although many methods for treatiiig canes mist, thexe ia a i^udl recognised 
neied to develop nenv and improved techniques. Selection of a suitable treahneat £br 
cancer may be predicated on cotrect identification of the aetiology of the disease. For 
this reasoa it is im{)OTtam to identify the cause of a given patient^s cancer. In addition 
to new treatmcnls there is a requirement £>r new diagnostic tools able to detect 
cancers. 

It is believed that there are likely to be bundt^da of geniss <he aberrant 
eixpresaon of tr^vcIi is associated with cancer £3rmailon. Since the analysis and 
modulation of such g^es has the potential to forai the basis of both methods of 
diagnosis and treatment of cancers it is a recognised goal of biomedical research to 
identify genes, and their products^ involved in oncogenesis* 

CuiTBnt strategies for identifying such genes and protcios are primarily based 
upon two strategies. The firet is genomic sequencing, m which the entire genomes of 
the most common cancers (such as bteast cancer, colon cancer, lung cancer and 
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prostate cancer) are to be S8q.uenced and compared with the nonnal human saoome in 
Older to identity mutant getiss which may play a role in the development of canco*. 

The second appioadi is to identity those fienes the espresskoi of whicji is 
altered m any parlioutar cancer. This may be assessed by the analysis of cxpieBsion 
levels in samples of cancerous tissue from the individual compared to expression 
levels in either nomial tissue fiom the same individual or control samples taken fiom 
individuate without cancer. 

Both these shatBgieB snffet fiom significant drawbacks. Analysis of 
expression levds does not provide mfbrmation regarding Ifa© presenoe or absence of 
mutations within the genes being expressed. Strategies for comparative transcript 
expression analysis, such as mioro-anay piofilmg, do not provide any infonnsti«m on 
cancer associated point mutations and can prove problematic when dealmg with gene 
femily members, wWcto dispky areas of sigwifitaat homology. The Human Cancer 
Geoome Project will clearly identity cancer-associated pofat mutations but ndlher of 
the methodologies discussed will provide any direct functional amiotation. Urns boih 
approaches wifl identity many dianges that are not causal and axe nwrdy associated 
\vith malignant disease. 

Some companies conmily use protran/ptotohi inteeactian mapping in. a 
general approadi using retroviral expression of random peptides as a platform 
technology, however fins technique BtHl feils to take the functional perspective of flie 
proteins investigated into consideration. 

According to a first aspect of the present invention there is provided a mefliod 
of seroening a protein for involvranent in cancer eomiprising: 
i) exposing ihe piotran to a first viral onooprotan; 
fa^ assaying for interaction of the protein and the first viral oncoproteij^ 

iii) exposing the pwtdn to a second viral oncoprot^; and 

iv) assaying Ibr interactton of flw protdn and the second viral oncoprotein 
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wharem iiiteraction of the pmtxan with ttie viral oncoptotdns indieetes tibat the protda 
is involvBd in cancer. 

Jn step i) of tha invention the first viral oncoprotein is used as a "bait" to 
idajtifyproteins to which it is capable of bindj:AS (tefferred to as "pi«y*0- A protein to 
be scieaied is linis closed to ihe first viral oncoprotsin undeir conditions in which, 
should llie proton represent & target for the vital oncopmtem. the protein and viral 
oncopwjtein will be able to bind to one another. Such conditions may, fbr example, be 
produced in a cell m which an niteraction trap may be carried out 

Stsp ii) of the method allows the selection of those piotems screened that have 
bound to the jBrst viral oncopiotem, the targets of the oncoprotein, based upon the 
interaction of tha aoretaied protein and Che oncoprot^. This sleep may also be earned 
ont in a suitable interaction trap. 

Steps iii) and iv) of the melhod tepreseol i«pelittans of steps i) and ii) 
respectively, B«ve that in steps iii) and iv) tteprotran to be screened is exposed to the 
second oncoprotein and the relative binding odT the pcotdn and second oncqptotehi 



TfaaC a. screened prot^ represents a blndii^ partner for both the fust and 
second oncopratetos is taken to indicate that the prot«n in question is involved in 
cancer. 

It is pre&ned Chat those proteins that exhibit mteraetions wifli the firet viral 
onct^totem are idootified; for Instance by sequencing, and their identities noted. The 
same proteins na^ then be racposed to the second viral oncoprotBin and the identities 
of those proteins interactmg with the second oncogene established, Compaiison of 
the proteins ittteraotuig with die first and second oncoproteins will allow the 
production of '^weighted" results in w*ich fliose proteins inteicaotnig with the greatest 
number of viral oncoproietos tested represent more fevoured targets for fiirdi^ 
investigatian dian those uttttactmg witti lesser numbers of oncoproteins. 



assessed. 
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In an alternative emliodSinent only those protema fiiai interact with the fiist 
viral oocopiotBiii are exposed to fee second vvaH oncoptotem and assayed fof 
ipteraction with ttiat Mieoptotein. In this case the piotdns to be scxeawd are exposed 
to the viral oncoproteins sequOTtially, and after each assay fiar interaction with an 
ottcopiotdtt those proteins that do not exhibit tateraction excluded ftom the screeoL 
. Thus in this etnbodimcHt of die invention only those proteins that interacted widi the 
fiist viral oncoprotein woiild be exposed to the second viral oncoprotein. Thus after 
the two rounds of screening fiiose protems that remained wiflm fee screen would be 
those that inteiacted with both oncoprDteins used. 

It win be readily appreciated lhat the possible number of "njundsf* of 
screening that may be performed will only be Kmited by fl»e number of vinl 
oncoptotdns available to a person effectii^ die invention- 
Proteins identified by the screen as interacting wilh vieal oncoprotehia may be 
investigated for other known interactions with vital oncoproteins. Ibis may, &r 
example, be achieved by studying interactions reported in pubUshed Utetatuie or in 
relevant database. 

It win bo lOTieoialed fliat a given protein that shows more man one 
oncopiotdn inleraction aocoiding to the meflwd of the invention will lepreseot a gpod 
candidate for fcrther investigBtion and vaUdation as set out below. 

It will be recognised that, in die case of multiple rounds of screening with 
diWetit oncoproteins, those proteins d>at exhibit toteractions with a greater number 
of oncoproteins will represent better candidates fiir fiirthec investig^on than those 
protehis faiteracting wiUi a lesser nmnber of ohcoproteinB. The mmiber of interactions 
which a given protein is deemed to exhibit tnay take into consideration interactions 
reported in, for example, published scientific Hteratnre, in addition to interactions 
identified by means o£ fi>r example screraing using interaction 1r^. 
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Preferably the cancer is a mmr-viral cancer. 

His tissue from which the protein is derived is preferably a tissue having a 
high pn>lif^tive potentiali but in which the level of prolifiu^on is low. An 
mdioation of such a tissue type may be that its cells possess a xelatively open, 
ofaromatin structure, which is itself an indication of pronnaouous low level gene 
expression charactetistic of undiS^^ated stem cells. It is preferred fbat the tissue 
is selected ftom the groi^i conqmsing placenta, cord blood hamopoietic stem 
cells end &etal brain. Mo&t prefi^ably the protem is derived jfixMn placenta or cord 
blood CDM* haiemopoietic stem cells. 

The protein to be screened is prefierahly derived fiom a oDNA library. 
Alternatively the protein may con^rise a whole cell extract or selected proteins 
expressed by a cell type of interest 

cDNA libraries suitable for use in the invention may be deriv^ fiom any 
mammalian tissue. It will be ^predated that the cDNA library is ideally deiived 
fix)m human tissue when human genes involved in cancer are being screened. 

cDNA libraries for us© in the mvention may be produced by any suitable 
method known m the prior art. Examples of suitable mefliod$ that may be used fi>r 
the production of cDNA libraries are well known to those skilled in the arc. 

The exposure of the protehi to be tested to the selected viral oncoproteins, and 
the assessment of their interaction, is preferably perfhrmed as part of an interaction 
tcEtp. Many fhnns of interaction trap are known in the prior ar^ al&ou^ it is 
preferred to use a yeast two-hybrid int^action trap to put ifae inv^oEu into effect 

Yeast two-hybrid Screening is a strategy for scareening for proteins that interact 
with a particular protda Typically a cDNA hlvaiy is constnocted such that candidate 
pDotems are expressed as translalionalfteions with part of a r^ Yeastcclls 
are then co-fcansfected with a *1>ait?' construct consisting of the cDNA of interrat 
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fused in-frame with the oiher part of the reporter gene. Only if both expressed 
proteins physicaUy interact will the two parts of the tegpoiter geaie be sufficiently 
closely fifisodlated to gener^ a signal. 

Most preferably the yeast two-hybrid interactjon tssap to be used may be 
modified as described in Gyuria, J.; Golemia, E.; OierQcov, H and Bteart, R. "Cdil, a 
human Ql and S phase piotato phosphatase that assodatcs with Cdk2." (CelL 1993 
Nov. 19; 75 (4): pp. 791-80S). The technique described in fliis paper is incoipotatBd 
herein by refeiencB. ft wiU, however, be appreoiatBd by those skilled in the art Aat 
atty other fenn of interactian trap may be used to put the invention into ptaolioe. 
Suitable exiles faieluded techmques such as it m nimfl Ha tt two-hylirid, bacterial two- 
hybrid or alternatively various types of pnM down assay using, fbr eawmple, an 
immobilised hybrid of the bait protein fbsed to a tag ptotein sucSx as glutalhifme 
transferase (GST% or any otiierproteui suitable for use to this purpose. 

In Older, when using a yeast two-hybrid interaction trap, to most efficiently 
assay fisr those yeast cells in which bofli bait and prey proteins are piesent it is 
necessary to selectively ampUfy the required yeast cells. This is independent of the 
piesenee^ or lade hereof, of interactions between the proteins of intaest. In prior art 
yeast two hybrid techniques it is usual to perform this selective arapMcalion using 
agar plates whiA incorporate a suitable selection medium. The total number of 
ttansfonned yeast cells are inoculated onto the plate which will only pennit the 
growth of those cells that contain both the cDHA derived prot^ and the target 
protein- Such oelk then produce viable colonies that ate subsequently transfianed to 
other cultiape plates to allow assessment of haieractioiiB. 

Whilst it is advantageous to incorporate sudi an amplliScation stqp into the 
yeast two hybrid screen, there arc notable drawback hi prior art techniques in that it 
is difficult and time eGnsaming to transfer ttie ampKfled colonies fiom fte plates 
on whidi have been ampUfied to the plates on whidi analyslB of interactions win 
bepcxlbnued. 





We have discovered that by ferfomiiiig the amplification step of the yeast two 
hyWd SCTBcn in fiiee solution of the suitable selection medium thi^e disadvantages 
may be overooma 

The ampllfloation of the cDNA library in yeast in ftee solution provides 
siffiificant advantages both in trans of the ea$e and speed with which the yeast cells 
.that have been poBitivdytransfotmed can be handed prior to pkti^^^ Furihermore 
there is an increase hi flie efJScie&cy of recovery of the amplified yeast cells fiom the 
culture. 

Previously published methods of performiixg yeast two hybrid interaction 
screening use an inoculating loop to directly replica transfer each single colony of 
ceUs from the master screen plate to each of the plates used for growth- We have 
found that by spotting a defined aliquot of a colony te-suspended in a suitable dihient, 
such as sterile distilled water, rather than simple replica plates produced by direct 
colony trans&r from the master platie, a defined number of cells are tjransfeired in a 
precise volxmie pear spot The consequence of this is that subse^oent growth is more 
u Tiifhrro and can be ccm|>aisd in amu/di more precise manner. 

Qnocyprotems that may be used according to the preset invention may be 
seilectod fiom the oncoproteins stressed by any trmsfbtmitig vims. Preferably the 
fiist and- second oncopioteins, to be used according to the preset invention are 
selected &am the group comprismg oncogenio hiunan papilloma virus 0lPV)» such as 
type 6, 16 and 1&» B6 and E7 proteins, hepatitis B TC\ hepatitis C "Core'*, SV40 
large "T*', adenovirus "BIA", human T lyn«>hotrophic virus types J and 2 ^ax*^, 
Epstein Ban virus **LMF' and "BBNA3". The selected oncoproteins may be used 
dtiier singly or in conibinations as described above. 

Pceferably the first viral oncop»tem comprises HPV 16 E6. 
Preferably tiie second viral oncoprotein comprises Tax. 
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In a preferred modification of the inveation a protein, identified by the screen 
as bemg involved in cancer, njay furdier be exposed to anofher (second) protein, or 
piotfflna, and an assay conducted for interactions between the two protons. In such a 
modification proteins identified by their interaction with viral oncoptotcdns according 
to steps t) to Wy of the first aspect of Hw invention tepresenl "anchor protein targets^' 
that are then used to identify "secondary protein targets". Tb& secondary protein 
targets identified in this case are those examples of the second protrfa ttat interact 
witb the anchor protein targpt. Thus dissection of Ibe inteiactians of bd anchosr 
protdn target involved in cancer enables identification of fiirther secondary protein 
targets that may also be involved in cancer and may represent suitable targets fi>r 
fiirflier anvratigation as set out below. 

Such secondary protein targets may, .for instance, represent or downstreani 
members of inlraceUuIar protein networks m whicto the anchor pjotein target is 
involved. The interaction of flie vital oncoproteins with the anchor protein target 
identify that the oncoprotdns are able to influence the activity of the network and 
hence, indirectly, the acdvity of the secondary protein targec. Exan^jles of such 
netwodcs include signaHing pa&ways and pathways influencing gene regulation. 
Protdms id«itifiBd in this way wUl repiesaot targets fia: fuithcr mvestigation as 
modulatoEB or maikos of cancer. 

A protein identified as an anchor protein target represents a suitable target fer 
fimire investigatiott with respect to its involvement in cancer. However the use of 
fltase anchor protein tacgpta as a means by which flirther secondary protein targets 
involved hi cancer may be identified is a great bettefit provided by the invention. 

So important is this benefit that according to a second aqpflct of the hwention 
tbesd is provided a method of screening a protehi sample for proteins dial are 
secondary protdn targets Ibr viral oncoproteins comprising: 



exposing an anchor protein target identified as ft protein havohred in cancer 
acct«dins to Uib first aspect of ttie tacveotion to flie protein sample; and 
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vi) assaying for interaction of proteins within the sample with the anchor protem; 

wherein proteins identified by their interaction with anchor proteiti tacsets in stqi viit) 
r^resent secondary protein tai^^ets involved with cancer. 

Protein fianqiles used according to the second aspect of ttis invention may 
piefeFsbly be derived from the same tiseue as tibie protein identified as being involved 
in cancer. IntaaciicHiB betwe^ the ptoteiiis may be tested and assessed by means of 
an intBtaction Hap, preferably by means of an intefaetion tr^ as described Sbove. 

"Hmse fflpoteins lliat are sdeeted by any aspect of the invraitioni as potentially 
involved in oncogenesis may thm be sequenced in order dial thrar identily may be 
discovered. Flasmlds encoding the protein of interest may be extracted him cells 
used in die ij|ieirMti<xi trap and tfadr sequoicc ddBmnaed by any suitable sequencing 
meOiod. Suitable means by which the plasmids may be extracted and Die sequence 
infbcraatian obtained be readily apparent to ttiose skilled in the art. 

T 

Knowledge of die sequence of a protein of interest, or of the gene encoding 
die piotein, will allow searches of relevant databases to be nndertakea in order to 
establish, where possible, the identity of the protein or gene. This identity 
in&nnation may then be used to investigate other reported interacticms of the protein 
in question and also establish ihe function of the protem. Ihfoimstiqn tdgarding 
known interactions muy allow tlie idaitification of other menib«s of fimctional 
pathways as set out above. Ihfomwtion about the function of die piotefai m^ be 
tosefid in identiftdng the possible mode of action of the protein in oncogeQesis, or in 
suggesting suitable means by which activity of the protein may be modulated, for 
instance by known modulatocs of a class of proteins of which the protein <rf int^est 
may be a member. 

A prefened onbodimetit would be the identification of a novel interaction of 
any secondary protein tfliget wHh my of the viral oncoproteins listed thus picfchig out 
the particular anchor/secondary protein target patfaw^ as being the target of multiple 
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viral insultfi. Idsally the new secandary protdn target would also be present in the 
newly acquired casalogae of anchor protdn tajigets. 

Bio-infonnatic analysU of pioteins Identified according to the fii«t or second 
aspects of flic ijwention and also of pathways identified by the Bcreen as being 
involved in cancer may be carried out as follows. Genes extracted fiom plasmids 
encoding anchcw or secondary protein targets can be Identified by ihe use of BLAST 
search of ihe non-i^imdaat and "Bcpressed Sequence Ta^' (KSl) NCBI nncleotlda 
databases. This strategy can also be need to identiftf other gfsne fenily membeca. 
Expression profiles can be evaluated in siUeo by the use of SAGE tag vhmalNoribem 
Blots. Information regarding repofted fmictionfl and known interactions of taigBts 
identified by the scre^ as involved in cancer may be contained in sdentifio 
publications or other piibUo domain databases. These can be swcessed via Pubmed, 
UmgpDB Gene Cards as can information regarding the chromosomal location of 
targets. Onomosomal location can also be ascertained by use of the Human Genome 
Gateway BLAT search program which provides infMmation regarding introo/exon 
boundaries, gene Btrocture, identity of adjacent genes and available ESTs together 
Wilii access to gene prediction progwois such as ENSEMBL. Chromosomal locations 
can be evaluated for cancer associated ampMcatiDiis/deletions etc. by the use of the 
Pbbmed database. The Pabmed data base can also be used to identify any secondary 
protein taigela previously shOTWi to interact with an anchor protein target under 
ravesligaiion. Por erample searches can be perfiMmed incotporathig the name of the 
prospective target and that of a suitable mteraction trap mfifljod. Ahemativdy the 
names of additional viral oncoproteuM can be faicoipoiated as search terms along with 
the names of either anchor protein targets or secondary protein targ^ identified using 
specific oiuxiprotefai baits, or the genes encoding such targets. Using PiAMed all 
prospective "anchof * and •'secondary'* targets can be cross referenced with the broad 
fimctiDual names of categories of protein fliat are known to bo responsive to flie action 
of commonly available •'drugs". Examples of classes of target proteins fin- wMeih a 
range of existing di-ugs are available include proteases, kinases, phosphatases, ion 
channels etc. 
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Homology comparison of two sequences can be cairied oirt using pairwise 
BLAST (NCBI). If the gene, and hence target protBin, ftmotiop is unknown this can 
be evaluated by using ptotein^nrotOTi caoss species homology searches to such 
oig^sms as C Elegan;^. D. Melanogaster, & Cerevisiae, or Jkf. Musculus^ Cross 
species infsrlog^ can also be used to identify additional potential mCecacton; within 
common pathways. 

Knowledge of the ftinction of a protein of interest may represent another 
oiiteria by which fevoured targete for fiirther mvesiig^iM or validation xnay be 
idmtified. For example it may be prefmed to InvestigaiB those proteins of a certain 
class, or for which known modnlalors exist, as a matter of priorily. 

hifeimation regarding the sequence of plasndds encoding those proteins 
identified by the interaction trap as potentially of interest is also usrful in discounting 
coding seqaences that may gjve '^se positive'* results. Sudi coding sequences may, 
fisr exantpl&» be those located in untranslated regions or other genomic contaminants. 

In a preferred embodiment the method of the invention further coitiprises a 
validation step. Proteins identified by the method of the invention as involved with 
cancer^ or ihe nucleic acids encoding such proteins, represent suitable ^'targets" &r 
such validation. 

Such a validation step may, for example, comprise analysi$ of expression 
levels of taigets identified by the screen. Analysis of levels of expression of 
identified targets^ and also of mutated forms of idenliiSed targets may preferably be 
conducted by means of comparing the degree of escpression of gene products, and the 
identity of the products expressed, between sano^les of cancerous and non-canceeous 
tLssueSv Prefetably sudi tissue samples will be derived from the same tissue typ^ 
and most prefei^ly the cancerous and non-cancerous tissue swaples will be derived 
fixnn the same individual. 
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Analysis of expression of targets identified by fee taeflKjd of screening of the 
invention may also indude analysis of expression of mutant fiwtns of targets 
identified. By "mxtant fbmis" we mean any proteins having at least 50% sequence 
homology (U. fl\e sequence of ftie amino acids fijiming tile protein or of the jnicleic 
adds encoding the piotdn) with the taisets identified by 4e Bcreemi^ method. 
Preferably nmtant fiitms of fee target wili ahaie at least 75% homology with the wUd- 
type target, and most preferably mutant forms of the target share at least 90% 
homology with Ihe wild-type target gene or gens product 

In an aHemative validation step rautant forms of nucleic aoida encoding target 
proteins identified by the screening method may be evaluated and analysed by any 
suitable technique towvwi hi the prior art. For example* such analysis may be 
performed by multiplex PGR. Multiplex PGR may. for example, be performed on a 
teal time muhiptex PGR machine. Altemadvdy RNA expression and altered splice 
forms can also be evaluated by Northern Blots. Comparison of expression levels can 
be canied out using matched pan: tamoutAiormal total cDNA array blots such as those 
produced by Clontech. 

A suitable validation step may also comprise analysis of fee presence of point 
mutations in the genes encoding proteins ideufified by fee screening method. A 
siiitablB mefeod by wWtiH sudli an hivesligation may be oaftied out is by assessnaeat 
of denaturing HPLC CTiansgenomic Wave analysis) studies comparing cDNA 
amplification products derived fiom cancerous and non-cancamus tissiw samples of 
fee type indicated above. Other sequencing methods known in fee art are also 
suitable for use in analysis of putath^c mutationB in genea etnooding target proteins. 

Mutations of genes encoding targets identified by the scteaning mefliod of fee 
invention may also be detected by analysis of the genomic DNA fiom which fee 
cDNA referred to above are derived. 



Anofeer parameter that may be investigated in any validati<m stejj used may be 
"alleUc loss" or *ljomozygous loss" among genes encoding taiget protehis identified 



NO.690 P.IB/^ 

by tha screeii of Ihe invention. Homozygous loss occurs when a qjedfio gene has 
deletions present in toth alleles. Homozygous loss can be detected by PGR of either 
genomic or cDNA. Other suitablo tedmiqnes that may be eo^Ioyed are Norfhem or 
Sonthem blotting. Allelic loss occurs when only one aUde of a gene is deleted. 
Allelic loss can be demotkstratcd by analytical PGR con^t^^nison of genomic DNA 
fiom nonnal and tumour tissues using an infonnative single noclBotide polymoiphjsm 
(SNP), identified fiom the public database, or a designed SNP assay. Allelic loss is 
indiGatBd by the presence of a restriction fiagment polymoiphism in one allele and not 
the other. 

Validation of the interactions and properties of both target proteins and Iheir 
coding genes may also be investigated in vitro or m vivo. 

For example gmes tsncoding targets may be cloned into vectors expressing 
detectable "tags" such as the pcDNAVSHis vector. Expression of a gejtie of interest 
using this vector causes the protein to be produced bearing a small antigenic "tag** 
protein V5 that fecilitates the iramuno-localiaaticn of the target protein. Thus 
expression of targets in conjunction with such tag proteins is particularly suited to the 
use of immuno-procipitation studies to inv^tigate the interactions of the target protem 
and other molecules, such as viral oncoproteins or other membeis of putative 
pathwayi^ in cultured cells. 

Protexns identified according to the £irst or second aspect of the inveoiion as 
being invotved with cancer^, i.e. to say anchor protein targets and secondary protein 
targetSp repieseat targets for tiier^eutic or diagnostic intervention. 

Furflier inlbnnstion as to the effect of over-^egqiression of a protein identified 
accoiduig to the first or second aspects of the invention, or alternatively blockade of 
its Ktpression. may be obtained by, for example gene transfer raperimente or the use 
of anti-s«se oligonucleotides. Such ejxperimaits may preferably be undertaken in 
transfbrmed or non-tcansfonned cell lines depending on the activity that it is sou^ to 
assess. Suitable ^«ns by wWch gene expression may be induced include die 
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Invitiogai GeneSwtch system in wMch induction of gene ejepression may be brou^t 
about by exposure to mifepristonfi. In vtw analysis of ttio effects of expressioii of 
toget proteins and their possible interaotionB may bo efifected using well known 
techniques sadh aa in vivo tumour xenogtaft models. 

A protean ihat is indicated, by tbe methods of flie invention, to be involved 
with cancer may be selected as a target for modulation for Hwcapy ft>r cancer, or as a . 
marker for ibs diagnosis of cancer. 

Once targets have been identified by means of the screening me&od of the 
invention then componnda that may be used to provide novel cancex therapies baaed 
upon the manipulation of levels of the target may be found. Such compounds may, 
for instance, be compounds that influence the level of transcription of Has gene 
encoding the target, or influence the aocnmnlation or bio-avdlabilhy of the targ^ 

The compounds may be used tt> treat cancer as a monothei^y O-e. use of the 
compound alone) or in combhuifion vnfh other compounds or treatmfints used In 
oaticer tfaei^y (©.g. chemoflierapeutio agents, radiotherapy). 

Known procedures, such as those conventionally enqtoysd by the 
pharmaoeutical industiy (e.g. in vivo experimentation, clinical trials etc), may be used 
to establish specific fbimulations of medicaments and precise Ibfirapeatic regimes 
(such as daily doses of the agpnts and the ftequenoy of adnrinistiatioiO that may be 
used to influence the ojqaession of identified targets and ther*y provide novel 
dier^ries for cancers in which flic target is hnplioaiad. 

Swh novel therapies may be used for the ptuposes of treating an esdsting 
cancer or may be used as a ptophylactio treatment administered to a person believed 
to be at risk of developing such a cancer. 

Once targets have been identified It ^will be appreciated that such targets may 
be used as the basis for mediods for the diagnosis of cancer. Such methods may. for 



exan^le^ comprise analysing a cell sample from a patient for the preseaoce of a tnutaot 
fimn of ths target and/cr altered exprossicm of the ^Id type target. 

Diagnosis may be effected on a sample from a patient believed to be sufiBaing 
from cancer, or alternatively to a patient believed to be at risk of developing cancer. 

Diagnosis according to Ihe invention may be efiEfected in order to establish 
Pettier ornot apatient is suff^ng from cancw. As an alternative diagnosis may be 
effected in order to assess the soitability of a qjecified thesapertic i^me for the 
tteatment of apattenf s disease. 

The sample taken may, for instance, be a tissue biopsy, blood sample m swab. 

* 

Diagnosis of the cellular levels of either the wild-type or mutant fom:is of the 
target may be carried out by assessing the level of the product of tide taiget within the 
cell, or ahcmatively by taking a measurement of the level of transcription of the gene 
enoodmg the tsocget. The expression of wild-type or mutant products of the target 
may, for example, be assessed through the use of specific binding agents including 
polyclonal and monoclonal antibodies in techniques such as irmmino-cytochemistry, 
tenmuno-precipitation, imitnmo-blotting (Western blotting) or enzyme linked 
immuno-^orbent assay QELISA). 

Detection of the wild type or muiant fomi may be directed to either detection of 
the proteinji or detection of the genetic mataial encodii^ the wild-lype or mutant 
proteuL 

A preferred expenmental protoml for effecting the fhst aspect of the invention is 
as follows: 

Proteins encoded by a suitable cDNA library, such as one detived ftom placenta, 
are screened by a yeast two-hybrid methodology (Oyuris et al., siq>ra) with selected 
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oncoproteins, such as HPV 16 E6 and "Tax"* as both 5' and 3' latA fiwions. A Qbot 
(Genetix) is used to fefsilitate the screening process. The results of this screen are a 96 
well fennat panel of yeast clones, which express the putative protein prey for each 
viraj oncoprotdn. Duplicate mnltiweU plates are inoculated and screened by 
blufi/wMte X-Gal selection for interactioDS between the protdns being scieeaed and 
fbe viral ancoproteiiffi. 

Seguatdng <if genes encoding prptefns identified by the method o/«6e inveniion 
i§s bang involved in cancer: 

Colonies eothibiting positive interactions between proteiiia being screened and flie 
vird oncopxoteinB are picked into 2 nil deep well 96 well plates, expanded in cuatore 
and cDNA "pre/* inserts amplified fioom a smaU aliquot of fUa eultore by the use of 
vector ilarfdng primeis. Ajl pi^ are PGR. sequenced by use of an eigh6-column 
CBC^OOO sequencer and cDNA. inserts identified by BLAST SMMto. 

Bioinformatics analysis cf genes encoding proteins ident^d ky ike method qf 
ike invention as being involved with cancer: 

This provides the identity of all candidate genes* thereby allowhig identification 
of flwae potential "anchor" target preys that are common to more than one vtad 
oncoprotein and allows giouptag into categories according to gene function by the use 
of Ho-in&nnakica (eg NCBI data base, PubMed. Human Genome Gateway etc) It 
also allow eHminaliott of; sibling clones derived fiom the same parent cDNA; donea 
derivBd ftom mm-coding sequence and out of firame sequence. Those protein preys 
common to moie fliBn ons oncoprotein, are the first to be selected for target vaHdation 
studies. Some protein preys may not direcfly interact with more than one oncoprotein 
but may reptesent strong candidates since fliere may be alternative •'secondary" 
taisets identified witWn a oommoa pafliway that interact wWi additional viral 
oncopiotBins to produce a similar oncogenic effect Thus it is also iaqwrtant to 
establish the possibilily of multiple viral oncoprotein involvement in common cellnlar 
palfawaya. The result of this analysis is the identification of putative protein targets 
within an interaction cascade flora the protdn identified according to the first aspect 
of the invention. 
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A prefored protocol ibr effectiug the secoad aspect of the iuvdntion is as 

The fiteps i) to iv) identified above were perbnned ta> identify an^or piotem 
tdi]gets for use in the second aspect of the invention. The following fhrther stq>s aie 
also perfbrmed. 

Second round yeast bvo hybrid screens may be earned out usiz^ selected anchor 
prot^ targets as bait. The products of these screms ireptes^t potential new 
secondary protein targets, which are then compared to the newly Identified catalogue 
of anchor protein targets of the diffetent viral oncogenes studied. Thse secondary 
protem targets aze also fiinctionglly evaluated by the use of bio-infotmatics for in 
silico interacdon mapping. On the basis of these findings, targets are then piioritised 
fer further validatton studies accoiding to ^druggabilily'' (e.g. ion channel^ kinase, 
proteases etc.) and the number of viral oncogenes that target a particidar interaction 
cascade. If the function of any novel target is unknown it is possible to ggiTi clues to 
this by spcamining the database of other species such as c. elegam^ 51 cerevisiae, Z>, 
melmogaster etc for homologous soqu^ces since there may be functional data on 
these homologues. Thns a matrix of weighted results of tho$e cellular proteins and 
their pathways and the number of viral oncoproteins that interact with the pathway is 
produced. 

Validation st^s that may be used according to the ISrst md seccnid aspects of the 
invaition are as follows; 

ValidaOon Phase One: 

Selected ^chor^' and ''secondary'' taig^ gates are directly mto a detailed 
anal3«is of transcript e^qpressioii (Multiplex PCR), amdation (Denaturing HPLC, 
Sequeach3g), allelic loss eto. in a variety of hutuan cancers and corresponding normal 
tissue types isolated, wherever possible, fiom the same padenL These data also 
provide a basis for distinguishing between targets that are associated with viral life 
cycle and those that are associated with cmcogenesis. This phase of the analysis may 
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be perfoniied using a real time multiplex PGR machine sudi as the MX4000 
(Stratagene). 



Validation Phase Twa 

Based m validadon phase one findings, selected targpts are enleted into aprogram of 
studies using a variety of obU culture systems. Cloning into "Tag" veottas, transient 
transaction and imminw-precipitation are used to confirm oncogenft-targpt interaction 
in mammaUan cells. Gene transffet and the use of anti-sense oUgonncleotides are nsed 
to assess the effects of either over expression or WocWng expression of selected 
taigets in difFerent transfonned and non-iransformcd cdl lines. pSwitch transfonned 
and non-transfisnned ceU lines way be used with the Gana Switch (Bivitrogcn) 
nrifepriBtone indudble system for evaluating the effects of stable induced expression 
of targpts in vitro. The same system can be nsed for controlled gene ejqjtcsaian wiA a 
murine in vfvotnmour xenogcafi model. 

The invention will now be described, by way of example only, with reference 
to the acGompanyin^ exampte and figure in whidi: 

Figure 1 represents the tesulls of probing a panel of pairmatohed cDNAs from 
normal and tumour tissue usmg ladioaotively labelled TIP-1 coding sequence. 
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Yeast tw>-hybrid selection, as desccibed by Oyuris et aL, stqtni, was used to identify 
proteins derived ftom a human placenta cDNA library that bound to the yital 
oncopsoteiiis human papiUama. virus type 6 £6 and human T-cell Isiikaonoseoio virus 
•Tax". 

Yeast cells eitpressing protdns derived ftom the placenta cDNA library were fiist 
exposed to HPV 6 E6 and Ooae clones expressing human protdns that interacted ivith the 
oncoproteins noted. The same panel of yeast cells expressing the ciDNA library were 
then assayed for mteiacticHis with "Tax% and those donas that interacted with the 
Oncoprotein noted, Conqiarison of the lists of interacting clones produced allowed those 
cloaes Hat encoded human proteins that reacted to both Oncoproteins to be identified. 

The gene encoding one of those placenta derived proteins that exhibited intraaotions with 
both viral oncoproteins was sequenced afi^r plasmid exoaction and fbund to have a 
sequence as Ibllows (Sequienoe ID No. 1 ): 

AGGGGCGCTC CX3GCCAGTGA TTOGCTGGAG GTXTGTTAAC 
TATTCATGAG GGGGCGGOCC 

61 GAGCGGGGCG GCCTTTGTTA AGCAGCGAGG GCGCQACCOC 
GGGTACTCTG CTGCCGGCTT 

121 CTCGGAGCGG CGCTGGGCGA CCAQAGCAGQ GTCGAQATGT 
CCTACATCCC GGGCXJAGCCG 

ISI GTCACCGCCG TGCTGCAAAG AGTTOAAATT CACAAGCTC^ 
GTCAAGGTGA GAACTTAATC 

241 CTGGGTTTCA GCATTGGAQG TGQAATCGAC CAGGACXJCTT 
CCCAGAATCC CTTCTCTGAA 

301 GACAAGACXW ACAAGGQTAT TTATGTCACA CGGGTGTCTG 
AAGQAGQCCC TGCTGAAATC 
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361 GCTGGGCTGC AGATTGGAGA CAAGATCATG CAGQTGAACQ 
GCrGGGACAT GACCATGGTC 

421 ACACAOQACC AGGCCCGCAA GOGGCTCACC AAQCGCTCGG 
AGGAGQTGGT GCGTCTGCTG 

481 GTQACGCGGC AGTCGCTGCAGAAGGCCG?rGCAGCAGTCCA 
TGCTQTCCTA GCAGCCACC A 

541 CCATCTGCGA CTCCTGCCTG CCGCCTCTCT GTACAGTAAC 
GCCACTTCCA CACTCTGTCC 

601 CCATCTGGCT TCTGCTGACC GCTGGGCCCC ACCTCAGAAG 
GGCTATAGCT GOTCCCAG AG 

661 GCCTGGCCTG GCCTTCdTC CCTTCTCCCA TOCCTGGCCT 
GGGGCCTCTG GGACCAGCTT 

721 TCTCTCCTGG ACACCGAGGA TTGGAAATAA GGGCCTGGAG 
CTG AGTAQTA GCC AGTCTGC 

781 TGTGACCACA GGCTCAGGTC CGACCCTGCT GCTTGGCCAC 
AGCAGTGGCT GGGCAAGTGG 

841 GAACCACTAT CTCTTGGGAG a:C«CAAAAG CTQGGAAATG 
CTGOAGOAAC CAGGCCTTTC 

901 CCGCTTTTGC CTQGCTGCAG GGTTCGGCTC CX3CCCCTGCC 
CCC3CAGCOCT OGTGTGTCX; A 

961 CACCGCAGIG CXrrCTGCCCC TCGGGGGACT GGACACACAT 
CCTQCCAGAG GCGCTACXSAA 

1021 GCTTTCCCCA QATGAAQCGA GGTGGGCTCC QCGTTCACTC 
CCACTCTCCC GAGGGOTGCT 

1081 GGCCrrCCCCA gggtttgoct tcttacggat ttagacgagg 

TTCGAGGCTC ACCTATCAGG 

1141 GCAGCrcrcA ggattgtcat TTTCCTcrrr gcctgtgggt 
ttaacttttg tattttttta 

1201 ATCACAAGTT TGATACAAAA TGTmTATC GTACTCTTTQ 
GAGATGCCCA TTCTACmT 

1261 GAATTTAGCT TTTACTAATT CGCATCTGGA AGCTCAGCAA 
GTGCACAAGC CTTACTTTGG 
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1321 TTACCGTGGA AACCACTGCC GCCCCTCCCC GATQTGGTGC 
GC?rCAATAAA AATGCTGQAA 
1381 TTCAAAAAAA 

Compariscm of sequCTce ID No. 1 with publidy available database infettnation revealed 
fhat sequence ID No. 1 corresponded to the published sequence of the gene (Aceession 
number AF028823) encoding the HTLV *Tax** intwactiiig protiem 1 (TlP-1). 

In oidqr to valine the method and identify that TIP-1 is indeed involved in cancer 
fhll length Ttp-l coding sequence was radioactively labelled by xandom primiug vnSi cc 
^^P dCTP and used to screen a panel of 250 matched pairs of normalised total cDNAs 
fiom normal (Left hand column dots in Figure 1) and tumcair tissue (Right hand column 
dots Figure 1) j&om die same individual. (Clontech). (See Figure 1). This blot covers a 
"svide rms^ of human cancer types and the results indicated that liie exptessitm of TIP-1 
was up-regulated approximately ten fold in; 28% of (n "=14) ovarian carcdnoma, 36% of 
(n » 50) breast carcinoma. 33% of (n«21>lungcarcinomaa30%of (tt = 42) muterine 
carcinQma and S0% of (n = 6) in thymid carcinoma. However, analyds of the histology 
fiom tbese variouB carcinomas indicated that different caxcinomas^ sucbas hmg^had sub- 
classificaiians within the overall category. Out of 21 hmg carcinQmaa 5 were 
keratinisi]]^, of wliich,non showed any up-regalation of Tip-1. Out of the remainhiglG 
lung carcinomas, 8 (50%) showed extensive up-regulation of Tip-1 eKprffi&ioa 



Thffie results are consistent wxth TIP-1 being involved in cancer. The &ct that TTP-l 
was Identified by following methods according to the first aspect of the invention 
validates fiie claimed meihcdL 
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CLAIMS 



1 . A methocl of screening a protein far involveiment in canca: comprising 

i) exposing Ibeprotdb to a first viral anca|n:ot^ 

ii) afisoying fi)r intoiBCtioa of the protein and the fiTst viial oncoprotein; 

iii) exposing tbe protein to a second viral oncoprotein; and 

iv) assaying far interaction of the protein and the second viral oncoptotdn 
wherein interaction of the ptotdoi with the viral oncoproteina indicates that iho protein is 
involved in cancer. 

2. The method according to claim I, wherein only those protons that interact wifli the 
first viral oncoprotein are exposed to the second viral oncoprotein. 

3. The method according to any preceding claim, wherein the piotein is derived from a 
tissue having a high proliferative capacxty hut low proliferation. 

4. The method of claim 3, wfaerain tiie tissue is selected from placenta or cord 
blood CD34'^ haemopoietic stem celis. 

5. A method according to any preceding claim wherein those proteins fSiat 
exhihit interaction with a viral oncoptotdn are correlated with hiteractions "with other 
oncoproteins. 

6. A method acccnrdjqg to any preceding claim, wherein the first and second vlia] 
onopptotdns are selected from the group comprising human papilloma virus type 6^ 
16 and 18 B6 and E7 proteins^ hepatitis B **JC\ hepatitis C '•Cora^*, SV40 large 'T", 

adenovinis '"Bl A", human T lymphotrophic vhw types 1 and 2 'Tax", Epstein Barr 

vfaus "IMP*' and '^EBNAS*'. 



7* A method according to claim 6, herein the first oncoprotein is HPV 1 6 E6. 
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8. A metbod according to claim 6 or claim 7, wherrin the second oncoprotBin is 
"Tax''. 

9. A method according to any preceding claon, wherein the protein is derived 
fiom a cDNA library. 

10. A mediod according to any preceding claim additionally ecmprisiug a 
validation phase. 

IK A method according to claim 10, wherein the validation phase coiiipri€e$ the 
ftirther ^tep of: 

a) aualyBing expression levels of a protein, indicated as being involved in cancer, in 
cancerous and non-cancerous tissue samples; and 

b) conqyaring the levels of expression of die protein in die cancerous and nOn-cancerous 



12. A mdhod accoTdmg to claim I0» wherdn the validation step con^qnises analysis 
of nuddb acids derived ftom a tissue sample for the pr^ence of mutant fbtm of the 
nucleic acids encoding a proteia indicated as being involved in canc^. 

13. A metfiod acconiing to claim 10, wherein the validation phase con^sea analysis 
of DNA derived from a tissue sample for the presrace of mutant farms of fhe DNA 
encoding a proteui indicated as being involved in cancer. 

14. A method according to any preceding claim, farther conqniaiiig: 

v) expoaing a protein identified as a protein involved in cancer according to claim 1 
to a proton sample; and 

vi) assaying for interaction of proteins within the sample with the prol^ identified 
as involved in cancer according to claim 1; 

wherein proteins identified in step vi) are secondaiy pratein targets involved with 
cancer. 



tissues. 
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15. A metiiod according to claim 14, wherein the second protein is dedved from the 
saniQ tissue as the protein exposed to the viral oncopzoteins. 

16. A method according to amy preceding daini. wherein the protein, mdi ca t ed to be 
involved with cancor is selected as a target far modulation for therapy for canoer. 

17, A method according to any preceding claim, wherein the protdn indicated to be 
involved with cancer is selected as a masker for Ihe diagnosis of cancer. 

IS. A method of screening a protein sample for proteins that ate secondaiy protein 
targets for viial oncoprotetm comprising: 

i) exposing protean identified as a protein involved in cancer accoiding to claim 
1 to theprotdn sample; and 

ii) assa^ng for interaction of protems wifhm the sample with ibe anchor protein; 

whetein protems identified by ttieir niteeacliion with the protein involved in cancer in step 
ii) r^resent secondary protein targets involi^ with cancer. 
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